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■ Abstract 

The problem of signal recovery from the autocorrelation, or equivalently, the mag- 
nitudes of the Fourier transform, is of paramount importance in various fields of en- 
^ . gineering. In this work, for one-dimensional signals, we give conditions, which when 

satisfied, allow unique recovery from the autocorrelation with very high probability. In 
particular, for sparse signals, we develop two non-iterative recovery algorithms. One 
of them is based on combinatorial analysis, which we prove can recover signals upto 
sparsity o(n^/^) with very high probability, and the other is developed using a convex 
' optimization based framework, which numerical simulations suggest can recover signals 

■ upto sparsity o{n^^^) with very high probability. 
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1 Introduction 

Signal extraction from the autocorrelation, or equivalently, from the magnitude of the Fourier 
Transform is known as phase retrieval. This problem fundamentally arises in many practi- 
cal systems such as X-ray crystallography [1], astronomical imaging [2], channel estimation, 
speech recognition pj etc, and has attracted a considerable amount of attention from re- 
searchers over the last few decades [1]. Various algorithms have been proposed to retrieve 
phase information [5l [6] and a comprehensive survey of them can be found in [3, E]- 

For one-dimensional signals, since the mapping from signals to autocorrelation is not 
one-to-one, unique recovery is not possible in general. For any given Fourier transform 
magnitude, every possible phase corresponds to a different signal. Hence, additional prior 
information on the signal is required to limit the number of valid phase combinations. |9] 
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uses multiple structured illuminations, in which several patterns using different masks are 
collected to guarantee uniqueness. 

We assume that the signal is sparse, i.e., the number of non-zero entries in the signal is 
much less compared to the length of the signal. This constraint greatly limits the number 
of possible phase combinations, and research has been done recently to exploit this feature 
[121 Us] . In many applications of phase retrieval, the signals encountered are naturally sparse. 
For example, astronomical imaging deals with the locations of the stars in the sky, electron 
microscopy deals with the density of electrons, and so on. 

In this work, we prove that signals can be recovered from their autocorrelation with ar- 
bitrarily high probability under certain conditions. We prove this using dimension counting, 
based on the ideas used in [111 |13] for multidimensional signals. We also propose two non- 
iterative recovery algorithms to extract sparse signals from their autocorrelation. Note that 
the phase recovery problem is inherently non-convex, and relaxations similar to the ones 
used in [lOl [131 US] cire used to develop a convex-optimization based framework. 

The paper is organized as follows. In Section 2, we discuss some properties of auto- 
correlation and spectral factorization which we use for signal extraction. In Section 3, we 
prove that signals can be recovered from their autocorrelation with very high probability 
under certain conditions. Non-iterative recovery algorithms are proposed for extraction of 
the signal from their autocorrelation in Section 4. Section 5 presents the simulation results 
and concludes the paper. 



Let X = (xo, xi, ....Xn-i) be a real- valued signal of length n. Its autocorrelation, denoted by 
a = (ao, oi, ....a„_i), is defined as 



j 

where x is the time-reversed version of x. Note that cyclic indexing scheme is used in this 
definition. Rewriting ([Tj) in the ^-domain, we get 



where A{z) and X{z) are the z-transforms of a and x respectively. Since x is real valued, 
X{z) is a polynomial in z with real coefficients and hence its zeros occur in conjugate pairs. 
Also, since A{z) = A{z~^), if zq is a zero of A{z), then z^'^ is also a zero. Hence, the zeros 
of A{z) appear in quadruples of the form (^zq, Zq, Zq^ , Zq*). 

The extraction of x from a, or equivalently X{z) from A{z), is known as spectral fac- 
torization and deals with the distribution of these quadruples between X{z) and X{z~^). 
For every quadruple (^o, Zq, Zq^ , Zq*), we can either assign {zq, Zq) to X{z) and {zq'^,Zq*) to 
X(z^^), or assign {zq'^,Zq*') to X{z) and {zo,Zq) to X(z^^). The total number of different 
valid factorizations hence is exponential in the number of such quadruples. 



2 Theory 
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Lemma 2.1. // two distinct finite-length real-valued signals fi and f2 have the same auto- 
correlation, then there exists finite-length real-valued signals g and h such that 



fi = g^h f2 = g*h (3) 

where h is the time-reversed version of h. 

Proof. Let Fi{z), F2{z), G{z) and H{z) be the 2;-transforms of the signals fi, £2, g and h 
respectively. Since fi and ¥2 have the same autocorrelation, ([2]) gives us 

Aiz) = F,iz)F,iz'') = F2{z)F2{z-^) (4) 

where A{z) is the 2;-transform of the autocorrelation of fi and f2. For every quadruple 
{zq.z^.iZq'^ ,Zq*) which are zeros of A{z), (^Oi-^^o) ^as to be assigned to ^1(2;) or Fi{z~^), 
and F2{z) or ^2(2;"^). Let Pi{z), P2{z) and ^3(2;) be the polynomials constructed from 
such conjugate pairs of zeros which are assigned to {Fi{z) , F2{z)) and {Fi{z), F2{z~^)) and 
{Fi{z-^) , F2{z)) respectively. Note that ^2(2) = Psiz'^)- We have 

Fi{z) = Pi{z)P2{z) (5) 
F2{z) = Pi{z)P3{z) = P^{z)P2{z-^) (6) 
and hence Fi{z) and F2{z) can be written as 

F,{z) = G{z)H{z) F2{z) = G{z)H{z-^) (7) 

where G{z) = Pi{z) and H{z) = P2{z), or equivalently 

fi = g ★ h f2 = g ★ h (8) 

in the time domain as the ^-transform of h is H(z^^). □ 



3 Unique Recovery 

In this section, we establish the fact that within the class of signals with non-uniform support 
(defined later), there is a one-to-one mapping between signals and their autocorrelation 
almost surely. 

Lemma 3.1. If f : A ^ B is a map from A to B, where A is a manifold of dimension da 
and B is a manifold of dimension df,, then the image of f is measure zero in B if da < d^. 

Note that any signal of length n can be represented as a vector in 7^". Let f be a 
finite-length real- valued signal of length n. Let / represent its support, defined as the set of 
locations where the f can have non-zero entries. We say that a signal f has uniform support 
if the indices of the elements belonging to the support are periodic, i.e., in an arithmetic 
progression. The size of the set / denotes the sparsity of f . Let J-^ denote the set of signals 
with sparsity k. Observe that J-^ is a manifold of dimension k. 
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Lemma 3.2. Suppose g and h are finite-length real-valued signals with support set Ig and Ih 
of sparsity kg and kh respectively. If J^gh denotes the set of signals g*h, and Igh its support. 
Then 

(i) The set Tgh is a manifold of dimension kg -\- k^ — I. 

(a) Igh has sparsity kgh > kg -\- kh — 1, with equality iff g and h have uniform support. 

(Hi) If i = g * h, where I, the support of f, is a subset of Igh with sparsity k. The set of 
such f is a manifold of dimension kg -\- kh — 1 — where 7 = kgh — k. 

Proof. We refer the readers to [11] for the proof of (i) and (Hi), (ii) directly follows from 
the properties of convolution. □ 

Lemma 3.3. Suppose f = g * h, with f having non-uniform support where as g and h have 
uniform support, also has the additional property that f has non-uniform support. Then, the 
set of such signals is a manifold of dimension strictly lesser than kg -\- kh — 1 — 'j. 

Proof. The idea of the proof is similar to [11], based on dimension counting. We saw in 
Lemma 13.21 that the set of signals f which can be represented as g * h with sparsity k can be 
written as a manifold of dimension kg + kh — l — 'j. The new set of constraints introduced by 
terms in f being result in a further reduction in dimension. Hence the set of such signals 
belong to a manifold of dimension strictly lesser than A;^ + fc^ — 1 — 7. 

□ 

Theorem 3.1 (Main Theorem). Signals can be uniquely recovered from their autocorrelation, 
or equivalently, from the magnitudes of their Fourier Transforms almost surely iff they have 
non-uniform support. 

Proof. Let J-"^ be the set of all signals f with non-uniform support of sparsity k which have 
another signal f with non-uniform support and same autocorrelation. Note that is the 
set of signals of sparsity k which cannot be recovered uniquely from their autocorrelation. 
Lemma 12.11 showed the existence of signals g and h such that 

f = g * h f = g * h (9) 

From Lemma 13.21 we note that the dimension of J-"^ is less than or equal to kg -\- kh — 1 — 

Case I: kgh > kg + kh-l 

This is the case if g and h do not have uniform support. In this case, the dimension of J^l 
is strictly less than k. Hence from Lemma 13.11 we see that is a set of measure zero in 
J-fc and signals with non-uniform support can be recovered from their autocorrelation almost 
surely. 
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Case II: kgh = kg + kh - I 

In this case, g and h have uniform support. If f and f have non- uniform support, from 
Lemma I3.3[ we see that the dimension of is strictly lesser than k = kg + kh — 1 — j, and 
hence can be uniquely recovered from their autocorrelation almost surely. 

Suppose f or f have uniform support, there will be no additional reduction in dimension. 
This case is equivalent to recovering a one-dimensional signal uniquely with no additional 
constraints, which is almost surely not possible. □ 

4 Recovery Algorithms 

In this section, we develop two non-iterative recovery algorithms for the extraction of sparse 
signals from their autocorrelation. 

4.1 Algorithm 1 

Algorithm 1 is based on combinatorial analysis. We propose a method to recover the support 
of the signal from the support of the autocorrelation, and prove that recovery is possible with 
very high probability if the sparsity of the signal is o{n^^^). Using this support knowledge, 
we show that signals can be recovered from the autocorrelation with very high probability. 

Suppose X is a signal of length n such that each element in x belongs to the support 
with a probability -, where s = n^^a < 1, independent of each other. Let a denote its 

autocorrelation, k denote its sparsity and D = {di,d2, d^} be the set of indices of the 

elements belonging to the support. Also, let dij be defined as \di — dj\ for = {1, 2, ....k}. 
If A is the set of indices of elements belonging to the support of the autocorrelation, then 
A = {{Jijdij}. Note that is a geometric random variable with parameter ^. Without 
loss of generality, let us assume dk-i,k > di2, otherwise we could just flip the signal and 
consider the flipped signal. Define Ai = {dij — di2\dij G A} and A2 = {dij — rffc-i.fcMy ^ 

The algorithm for signal recovery is described below. In what follows, we give a sequence 
of lemmas to justify various steps of the algorithm. 

Algorithm 1 

Input: The autocorrelation a of the signal. 

Output: The sparse signal x which has autocorrelation a 

• Extract di2 and d^-i^k from A (Lemma 14. 4p . Calculate the sets Ai and A2. 

• Perform (^4 fl Ai) fl ((i2,fc-i — H A2)) and identify the support u of x (Lemma l4.5p 

• Construct the graph G (Lemma 14. 6p using u, identify an odd cycle and a path con- 
necting all the vertices and extract x. 
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Lemma 4.1. The sparsity k of the signal satisfies (1 — e)s < k < (l + e)s with very high 
probability for any e > 0, n > n{e). 

Proof. Use Chebyshev's inequality. □ 

Lemma 4.2. For three independent random variables Xi, X2 and X3 where Xi and X2 are 
geometric random variables with parameter ^ , P{Xi — = qX^) < ^ if s = n°',a < 1 
for n > n{e), where p and q are integers. 

Proof. Refer Appendix. □ 
Lemma 4.3. P{dk-i,k - ^12 G ^) < (1 + e)^ for any e > 0, n > n{e). 
Proof. Using union bound, we obtain 

P(4-i,fc - c?i2 e A) < ^ ^ P(4-i,fc - = dij) (10) 

= -P(4-i,fc — di2 = dij) + P{dk~i,k — di2 = dik) 

+ Y.P{dk-i,k-di2 = dij)) (11) 

Note that the dij^s for i 7^ l,j 7^ /c are independent of di2 and 4-1, fc- Hence Lemma 14.21 
can be apphed and each term in the first summation can be upper bounded by ^. Since 
4-1, fc < dik and di2 > 0, all the terms in the second summation are zero. The terms in the 
third summation can be equivalently written as P(4-i,fc ~ 2(ii2 = 4j)) and Lemma [4.21 can 
be used to upper bound every term by -. Since dik is the largest sum, we need not consider 
it in the summation. Hence, we get 

3 3 

P{dk-i,k - 42 eA)< < (1 + e')'- < (1 + e)- (12) 

n n n 

□ 

Lemma 4.4. di2 and 4-i,fc can be recovered from the autocorrelation with very high proba- 
bility if s = o{n^/^). 

Proof. The first and second highest terms in A are dik and 4fc respectively since di2 < 4-i,fc- 
Note that dik — 4fc = du, hence du can be recovered from the autocorrelation. The only 

terms that can be higher than di^k-i in A are {4/=, 4/c, 4-i,A:}- Note that 4fc — dik = 4i, 

which belongs to A for all i = {3, k — 1}. So if 4a: — 4,A:-i doesn't belong to A, we 

can recover di^k-i by considering the highest term which when subtracted from 4a: produces 
a value which doesn't belong to A. The probability of failure can hence be written as 
P{dk-i,k — di2 G A) which goes to zero if s = 0(^3), as seen in Lemma [4.31 Hence both 42 
and 4-1, fc can be recovered with very high probability if s 
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With the knowledge of du and dk-i,k, we can construct the sets Ai and A2. Consider the 
intersection of A and Ai. All entries of the form for i — {3,4, ...k} will survive trivially 
for any signal. Similarly, all entries of the form di^k-i for i — {l....k — 2} will survive the 

intersection of A and A2 for any signal. If we subtract the survivors of the intersection 
of A and A2 from (^2,^-1, "we get d2i for i = {3,4,...k — 1}. Hence the elements d2i for 
i = {3,4, ...k - 1} will survive {A n Ai) n (^2,^-1 - {An A2)). 

Lemma 4.5. No other dij will survive {An Ai) fl ((^2,^-1 — (An ^2)) and hence the support 
can be recovered with very high probability if s = o(n^/'^) 

Proof. Suppose you choose dij such that i and j are picked at random. The probability that 
dij is a particular value can be upper bounded by ^. For a non-trivial dij in A to survive 
APl^i, dij + d\2 has to be in A. Similarly, d2k — d^j and d2,k-\ ~ dij has to be in A for 
it to survive d2k — A[\A2. Using union bounds, we see that the probability of survival of 
some other dij goes to when s — o(n^/^). Note that we have information about dk-i,k upto 
s = o{n^^^). 

If no other elements survive, from d2i for i = {3,4, ...k — 1}, we can extract rfj^j+i for 
i — {3,4, ...k — 2} and since we already know di2 and dk-i,k, we have the support of the 
signal. 

□ 

Suppose we have the support of the signal, D — {di, d2, dk} being the indices of the 

elements belonging to the support. Define a pair {di,dj) as a good pair if they are the only 
pair separated by \di — dj\. Note that for such a pair, a\di-dj\ — Xd^Xdj 

Lemma 4.6. Consider a graph G with k vertices, each vertex representing an element of 
the support. Draw a weighted edge between every good pair, the weight being the value of the 
corresponding autocorrelation. If the graph G has an odd cycle and is connected, then the 
signal can be extracted from the autocorrelation upto a global sign. 

Consider an odd cycle with 2r — 1 vertices ii,i2, .•.«2r-i- The term "^'.^'^ ' ' -I'l gives 

, from which can be extracted upto a sign, and from it the other terms in the odd cycle 
can be extracted using the weight corresponding to the edges. Since the graph is connected, 
all the other terms can be calculated. 

Lemma 4.7. The graph G has an odd cycle and is connected with very high probability for 
s = o(nV3). 

Pick any three vertices randomly. Choose any path of length A; — 3 from one of those 
vertices to cover all the remaining vertices randomly. If all the edges exists between the 
three vertices and the chosen k — 3 length path exists, we are through. If any of the k edges 
doesn't exist, it implies that the distance between that pair of vertices occurs more than 
once. Since there are less than k"^ pairs, the probability of a pair of vertices not having an 
edge can be union bounded by ^. Since there are k edges to be considered, the probabihty 
of failure can be upper bounded by ^. Hence if s = o(n^/^), any chosen triangle and path 
exists with very high probability. 
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4.2 Algorithm 2 



Algorithm 2 is developed using a convex optimization based framework. Semidefinite re- 
laxation is used to convert the non-convex constraints into a set of convex constraints. We 
break the problem into two stages. First, the support of the signal is recovered from the 
autocorrelation and then we solve for the signal in the support. 

4.2.1 Support Recovery 

We have to extract u from the autocorrelation of the signal. We will assume that the support 
of the signal is a subset of the support of the autocorrelation. This is the same as assuming 
there is no cancellation of support in the autocorrelation, which is a very weak requirement 
and holds with probability one if the coefficients of the signal are chosen randomly from a 
non-degenerate distribution. With this assumption, ai — implies that no two elements in 
the support are separated by a distance i, and if is non-zero, there is atleast one pair of 
elements in the support separated by a distance i, i.e., 

Oj = ^ UjUi^j = W j (13) 

cii 7^ ^ UjUi+j 7^ for some j (14) 

where u is the binary support vector. This is clearly non-convex as the constraints are non- 
convex and u is binary. Define S = uu"^, which is allowed to be positive semidefinite, as 
it is the smallest convex set containing all rank one matrices. The entries of S are allowed 
to be in [0, 1], which is the best convex relaxation for binary variables. The trace of S is 
given by = J2i = ^) the sparsity of the signal. Also, note that Sij = J2i ^i^j = 

Uj Ui = kuj = ku'j = kSjj and similarly Sij = kSa. Since fiipped version of the 
support also satisfies all the constraints, a random matrix V is used to bias the cost. The 
support estimation problem becomes 

minimize trace(VS) 

subject to trace{S) — k S ^ 

i j 

^ Si^i+k > iff ttfe 7^ 

i 

0<Sij<l <i,j <m-l 

(15) 

Note that we assume apriori knowledge of the sparsity of the signal, i.e., the number of 
non-zero locations of the signal is known. 
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4.2.2 Signal Recovery 



Note that the autocorrelation constraints are non-convex. As we did in the support ex- 
traction, we use the semidefinite relaxation X = xx^. We append n zeros to the signal so 
that cyclic indexing scheme can be applied, hence a m = 2n order DFT matrix is required. 
Suppose Mk is the m x m matrix defined by M^. = ffcfj, where is the k^^ column of the 
DFT matrix for k = {0, 1, ....m — 1}. The autocorrelation constraints can be written in the 
Fourier domain as 

Yfc = trace(MkX) A; = 0, ,m-l (16) 

where Y = {l^/oPi is the vector containing the squared magnitude of the 

Fourier transform of x. We can solve for the signal using Ll-minimization fi6\ [T7t [18] . 



minimize 
subject to 



l|X||i 

Yk = trace(MkX) 



k = 0, 



.m 



1 



(17) 



X ^ 



if Sij = <i,j <m-l 



5 Simulation Results 
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Figure 1: Success rate of recovery using Algorithm 1 for n = 8192 (n^^^ ^ 20) for various 
sparsities 



Figure 1 shows the success rate of signal recovery using Algorithm 1 as a function of the 
sparsity of the signal. We see that signals with s = o{n^^^) are recovered successfully with 
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Figure 2: Success rate of recovery using Algorithm 2 for n = 64 (n^/^ = 8) for various 
sparsities 

very high probabihty. While the algorithm is computationally very cheap, it is not robust 
to noise due to error propagation. 

Figure 2 demonstrates the performance of Algorithm 2 as a function of the sparsity of the 
signal. Numerical simulations strongly suggest that signals with sparsity upto s = o(n^/^) 
can be recovered using this algorithm. It is also very robust to noise and hence more practical. 
We hope to provide theoretical guarantees in a future publication. 



6 Appendix 

Lemma 6.1. For a pair of geometric random variables Xi and X2 with parameter - each, 
P{Xi — pX2 = c) < - if s = n'^,a < 1 for n > n{e), where p and c are integers. 



6.1 Proof of Lemma 16JJ 

P{X^ -pX2 = c) = J2 Pi^2 = i)P{Xi =pi + c) 



00 



i=0 

00 

{l+p)i 



00 

^-^ n n n n n n ^-^ n' 

i=0 i=0 

1 - (1 - (1 + Ml) " " 
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for n > n{e). 



6.2 Proof of Corollary 11:21 

From Lemma [6 -H we see that 



P{X, - pX2 = qX,) = J2 Pi^s = ^)PiXi - pX, = qi) 



i=0 

oo oo 

^E^(^3 = 0^<^E^(^3 = .)<^ (19) 
i=0 1=0 
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